Multi-component Word Sense Disambiguation

نویسندگان

Massimiliano Ciaramita

Mark Johnson

چکیده

This paper describes the system MC-WSD presented for the English Lexical Sample task. The system is based on a multicomponent architecture. It consists of one classifier with two components. One is trained on the data provided for the task. The second is trained on this data and, additionally, on an external training set extracted from the Wordnet glosses. The goal of the additional component is to lessen sparse data problems by exploiting the information encoded in the ontology.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Evaluation of Graded Sense Disambiguation using Word Sense Induction

Word Sense Disambiguation aims to label the sense of a word that best applies in a given context. Graded word sense disambiguation relaxes the single label assumption, allowing for multiple sense labels with varying degrees of applicability. Training multi-label classifiers for such a task requires substantial amounts of annotated data, which is currently not available. We consider an alternate...

متن کامل

رفع ابهام معنایی واژگان مبهم فارسی با مدل موضوعی LDA

Word sense disambiguation is the task of identifying the correct sense for the word in a given context among a finite set of possible sense. In this paper a model for farsi word sense disambiguation is presented. The model use two group of features: first, all word and stop words around target word and topic models as second features. We extract topics from a farsi corpus with Latent Dirichlet ...

متن کامل

DFKI: Multi-objective Optimization for the Joint Disambiguation of Entities and Nouns & Deep Verb Sense Disambiguation

We introduce an approach to word sense disambiguation and entity linking that combines a set of complementary objectives in an extensible multi-objective formalism. During disambiguation the system performs continuous optimization to find optimal probability distributions over candidate senses. Verb senses are disambiguated using a separate neural network model. Our results on noun and verb sen...

متن کامل

How Phrase Sense Disambiguation outperforms Word Sense Disambiguation for Statistical Machine Translation

We present comparative empirical evidence arguing that a generalized phrase sense disambiguation approach better improves statistical machine translation than ordinary word sense disambiguation, along with a data analysis suggesting the reasons for this. Standalone word sense disambiguation, as exemplified by the Senseval series of evaluations, typically defines the target of disambiguation as ...

متن کامل

A Hybrid Approach to Word Sense Disambiguation: Neural Clustering with Class Labeling

By combining a neural algorithm with the WordNet lexical database we were able to semi-automatically label the groups of items clustered in a multi-branched hierarchy, paving way for the use of neural algorithms together with ontological knowledge in word sense disambiguation tasks.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Multi-component Word Sense Disambiguation

نویسندگان

چکیده

منابع مشابه

An Evaluation of Graded Sense Disambiguation using Word Sense Induction

رفع ابهام معنایی واژگان مبهم فارسی با مدل موضوعی LDA

DFKI: Multi-objective Optimization for the Joint Disambiguation of Entities and Nouns & Deep Verb Sense Disambiguation

How Phrase Sense Disambiguation outperforms Word Sense Disambiguation for Statistical Machine Translation

A Hybrid Approach to Word Sense Disambiguation: Neural Clustering with Class Labeling

عنوان ژورنال:

اشتراک گذاری